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ABSTRACT 



A method of text entry into a device by activating keys of a 
keypad (16), where a key represents various characters. 
Input is delivered to a processor (30) corresponding to the 
keys activated. A dictionary (33) is searched for candidate 
combinations of characters corresponding to the keys acti- 
vated. The candidate combinations are rank ordered, for 
example using language model data and feedback (14, 38) is 
provided to a user indicating at least a highest rai^ ordered 
candidate combination. 

19 Claims, 6 Drawing Sheets 
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METHOD AND DEVICE FOR INPUT OF 
TEXT MESSAGES FROM A KEYPAD 

HELD OF THE INVENTION 

This invention relates generally to a data entry method 
and apparatus, and more particularly to a method and 
apparatus for inputting text messages from a keypad, for 
example a numeric keypad. 

BACKGROUND OF THE INVENTION 

With the increasing desire to carry smaller and smaller 
personal communicating devices with optimum functional- 
ity such as pagers, cellular phones, and other personal 
communicator devices, and the desire to automate homes 
through combinations of telecommunications and cable 
where smart rooms will have one or more small devices that 
operate a variety of appliances or control a variety of 
applications; the need and desire to enter alphanumeric text 
through non-alpha or numeric keypad is ever increasing. 

In the United States of America it is standard for every 
telephone keypad to have letters as well as numerals dis- 
played on the keys. Thus, the key representing the numeral 
"2" has the letters A, B and C displayed and the key 
representing the numeral "3" has the letters D, E and F etc. 
It is common to represent a telephone number by the letters 
which correspond to the digits on the keypad. Thus, for 
example, a railway company with the name "Amtrack" can 
acquire the telephone number 268 7225 and can advertise to 
customers that the company's telephone number can be 
dialed by activating the keys with tiie letters A-M-T-R-A- 
C*K. Dialing these letters will result in only one digital 
sequence, but one digital sequence can be represented by 
many letter sequences. 

In the telecommunications field, methods of name selec- 
tion are known which use a numeric keypad. For example, 
it is known in a voice mail system to select a user's voice 
mailbox by keying in from a remote telephone the first few 
letters of a user's name, as a result of which a short digit 
stream is delivered to the voice maU system. The voice mail 
system compares the input digit stream with a limited set of 
options in a look-up table. SufiScient digits must be entered 
to uniquely identify the desired user. Similarly, automated 
customer service systems can use this form of text entry. For 
example an airline information system can request the first 
three letters of a destination airport and. notwithstanding 
ambiguities in the letters to be represented by the digits 
delivered, it can uniquely identify an airport from only three 
digits because it has a limited ^'dictionary" of airport names 
from which to seek a match. The same approach can be used 
in a telephone speed-dial look up arrangement. A name can 
be selected from a limited list of names stored in a telephone 
terminal and a number can be dialed associated with the 
name selected. 

Various other approaches for entering alphanumeric input 
through a non-alpha, numeric only, or other type of keypad 
having fewer keys or buttons than the corresponding alpha- 
bet have been developed or are known. Many of these 
methods take advantage of the letters displayed on the 
standard US telephone keypad. However, often the methods 
associated with the alphanumeric text through the keypad 
are cumbersome and difficult to edit. In addition, typically 
the entry must be restricted or entered in a specific manner 
such that there is only one option per sequence of input. 
More specifically, there is a direct correlation for a specific 
input to a specific letter of the alphabet or a symbol. For 
example, if a user wants to input the word "call" through a 
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non-alpha keyboard, the user has to enter a specific combi- 
nation of numbers and/or symbols to indicate the word call. 
By way of example for a user to enter the word call the user 
must enter the following: 2**'; 2*; 5***; 5***; #. Such a 

5 method of inputting is very restrictive, lime consuming, and 
editing is difficult and slow. 

Therefore a need exists to provide a user with a simple 
straightforward method and apparatus for inputting alpha- 
numeric text through a non-alpha, numeric, or other keypad 

10 having fewer keys than the desired alphabet; and the need 
exists to provide a system that is not restrictive to direct 
correlation or a specialized domain, i.e. where a variety of 
choices are available and easily input. 

Various preferred embodiments of the invention are now 
described, by way of example only, with reference to the 
accompanying drawing^. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 FIG. 1 is a front view of an illustrative text entry device 
suitable to receive input in accordance with the present 
invention. 

FIG. 2 is a block diagram detailing internal circuitry of the 
device of FIG. 1. 
2^ FIG. 3 is a state diagram illustrating operation of the 
device of FIG. 1 in accordance with a first embodiment of 
the invention. 

FIG. 4 is a flow diagram illustrating operation of the 
device of FIG. 1 in a principal entry mode of FIG. 3. 

FIG. 5 is a flow diagram illustrating further details of the 
operation of FIG. 4. 

FIG. 6 is a state diagram illustrating operation of the 
device of FIG. 1 in accordance with an alternative embodi- 
35 ment of the invention. 

FIG. 7 is an example of nine pinyin entries with corre- 
sponding digit streams and examples of selected Chinese 
characters which can be entered by operation of the alter- 
native embodiment of the invention. 
^0 FIG. 8 is a state diagram alternative to that of FIG. 6, 

DETAILED DESCRIPTION OF PREFERRED 
EMBODIMENTS 

Generally, a method is now described comprising the 
steps of: comparing input with dictionary and language 
model data stored in memory to generate at least one entry 
selected by probability of corresponding to the input; and 
providing user feedback of the entry having a likelihood of 
corresponding to the input. 

Particularly, a method of text entry inU) a device is 
provided comprising the steps of: activating keys of a 
keypad, where a key represents a plurality of characters; 
delivering input to a processor corresponding to the keys 
55 activated; searching a dictionary for candidate combinations 
of characters corresponding to the keys activated; rank 
ordering the candidate combinations; and providing feed- 
back to a user indicating at least a highest rank ordered 
candidate combination. Here, "character" refers generally to 
letters or to pictographic or grapheme characters. 

It may be noted that the dictionary is searchable by 
combinations of characters, where the characters by which 
it is searchable are the same as the characters represented on 
the keys. 

65 A method and apparatus are also described for input of 
text through a keypad having fewer keys than the number in 
the desired alphabet alone or in combination with numbers 
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and other symbols. Recognition of the input is preferably 
correlated with the most likely text selection from a 
dictionary, language, or compilation of data having an 
unrestricted set of characters. 

Language model data is preferably used to rank order the 
candidate combinations in order of probability, i.e. likeli- 
hood of usage in a given language. Such language model 
data preferably comprises character combination data. The 
character combination data comprises, for example, word 
unigram data, letter trigram data, or word bigram or trigram 
(or n-gram) data or, for example, in a Chinese or other 
pictographic or grapheme language version to be described, 
it may comprise pinyin letter trigram data, pictographic or 
grapheme character unigram data, pictographic or grapheme 
character bigram (or n-gram) data or word (i.e. pictographic 
or grapheme character combination) bigram (or n-gram) 
data and the like. 

In a preferred embodiment, at least one likely entry can be 
displayed in a display area of a device and the at least one 
likely entry presented in the display area can be edited 
through input of a keypad. 

Referring now to the figures, FIG. 1 illustrates a general 
and preferred embodiment of apparatus in accordance with 
the present invention. An example of a cellular telephone (or 
other data enUy device such as a wireline telephone, pager 
or personal digital assistant or telecommunications device) 
having a keypad is illustrated as generally depicted by the 
reference numeral 10. The cellular telephone 10 depicted 
comprises a housing 12, a display 14 and a keypad 16 
^ through w hich input is received. The display 14 has a text 
djsplay area IS iind an area 17 for displaying word, letter 
combination or character alternatives. The keypad 1 6 has 
^ jweb rfr keys j^h digits Q-9 displayed thereon in a siandarg 
^layo\i t. Aly^ difipTayaa m a stanaard l ayout ar e letters of the 
Roman alphabet A-Z. These ieuers can oe^rranged m other 
arrangements, ror example for greater ergonomic efficiency, 
but the illustrated layout is preferred on account of estab- 
lished use. The key bearing the digit "1"' has the punctuation 
marks and displayed thereon. The low^er left 

hand key 18 has the symbols (meaning "backspace" or 
"scroll left*' as is explained below) and * (referred to as 
"star"). The lower right hand key 19 has the symbols 
(meaning "enter" or "scroll right" as is explained below) and 
"#" (referred to as "pound"). 

In the preferred embodiment illustrated, the English 
alphabet is used. Alternatively, various other alphabets may 
be used in accordance with the preferred embodiments of the 



15 



entry device, English words and letter trigrams. The lan- 
guage model data 34 includes unigram weight values for the 
words and letter trigrams stored in the dictionary. Optionally 
the data 34 also includes word bigram and even word 
trigram data. Other langu&ge model information can be 
stored with unigram weight values 34. Ad optional loud- 
speaker 38 is shown coupled to the microprocessor. 

Referring to FIG. 3, the microprocessor 18 is shown as 
having three principal modes of operation for data entry (it 
may have many other modes and functions not relevant to 
the present invention). The first principal mode of data entry 
is standard entry mode 40 and associated with this mode is 
cycle mode 42. The other modes of data entry are numeric 
entry mode 46 and supervised entry mode 48. Standard entry 
mode 40 is now described in detail, with the other modes 
being described later. 

Referring now to FIG. 4, a flow diagram of a preferred 
embodiment of recognizing alphanumeric input in standard 
mode through a non-alpha or numeric keypad is illustrated. 
20 The example will be considered where the digit sequence 2 
2 5 5 is entered. As illustrated, the first input digit "2" is 
received in step 120 by pressing briefly and releasing the 
corresponding key . Subroutine 130 will be disregarded for 
the present as it relates only to entry of "#»" and "«*". In 
25 step 140 the mmieric sequence entered so far is sent on to the 
next step for comparison against the contents of the dictio- 
nary 33. Thus, each entry is appended by microprocessor 30, 
as it is received, with previously entered digits and the 
various possible corresponding letters are compared in step 
,30 150 with words from the dictionary 33. 

Thefirst digit "2 " c ao^ correspond to A, B or C . The 
comparison of step 150^ will search tor words and 
(optimally) letter trigrams beginning with the letters A, B or 
C. In the case of the first entered digit, all three options (A, 
B and C) are possible first letters of a word and step 150 
determines positively that at least one match is found. (Note 
that if step 150 reveals no candidate words then step 155 
instead looks for candidate letter trigrams). In step 160 all 
possible matches correlating to the input are identified and 
kept active for further steps in the process. In step 170, 
probabilities are assigned to the active matches using the 
language modeling data 34 and in step 180 the active match 
or matches with the highest probability are displayed in 
display area 17. In the case where only one digit has so far 
been entered, there is suflBcient space in display area 17 to 
display all three active matches A, B and C. In step 200 the 
program returns to step 120 and awaits the next digit input. 
When a second input "2" is received, the comparison 150 
present invention, such as Chinese phonetic alphabets or will now yield words and trigrams beginning with AA, AB, 
Russian or Asian alphabets. In the preferred embodiment of 50 AC, BA, BB, BC, CA, CB, and CC. If a third input, 5, is 
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the present invention illustrated in the figures, the input 
device is a keypad having fewer individual keys than the 
number of letters in the alphabet. Therefore, the alphabet is 
separated into various sequences where a single numeral and 
a series of letters or symbols are represented on a single key, 
or block of input, of the keypad or other input device. 
Pressing of a key delivers a digit, but Ihr digit gnn rfprcnyjlt 

Referring to FIG. 2. the cellular telephone 10 is illustrated 
^jas-bavipg-"ft^ciici; Qprocessor 30 couplc xUoJLh e^input pad 16 
and to the display 14 using standard input andouipul drivers 
as are known in the art. Also coupled to the microprocessor 
30 are a first memory 31 which is preferably electrically- 
erasable iea3^^iny'"5ieraory^EEPROM) and a second 
memory 32 which is preferably random access memory 
(RAM). In the EEPROM memory is stored a dictionary 33. 
The dictionary 33 includes, for an English language data 



received, the comparison 150 could potentially yield entries 
beginning with AAJ, AAK, AAL, ABJ, ABK, ABL, BAJ, 
BAK, BAL, BBJ, BBK, BBL, BO, BCK, BCL, CAJ, CAK, 
CAL, CBJ, CBK, CBL, CCJ, CCK, and CCL, but some of 
55 these trigrams may not appear in the English language and 
will be omitted by step 150. If a fourth input, 5, is received, 
words beginning with AAJJ, AAJK, AAJL, AAKJ, AAKK, 
AAKL, AAU, AALK, AALL, ABJJ, ABJK, ABJL, ABKJ, 
ABKK, ABKL, ABU, ABLK, ABU., BAJJ, BAJK, BAJL, 
BAKJ, BAKK, BAKL, BAU, BALK, BALL, BBJJ, BBJK, 
BBJL, BBKJ, BBKK, BBKL, BBU, BBLK, BBLL, BOJ, 
BCJK, BCJL, BCKJ, BCKK, BCKL, BCU, BCLK, BCLL, 
CAJJ, CAJK, CAJL, CAKJ, CAKK, CAKL, CAU, CALK, 
CALL, CBJJ, CBJK, CBJL, CBKJ, CBKK, CBKL, CBU, 
CBLK, CBLL, CCJJ, CQK, CCJL, CCKJ, CCKK, CCKL, 
and CCLJ, CCLK, and CCLL are possibilities to be searched 
in the dictionary (less any combinations ruled out in the 
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previous rouad of comparison). From this list of all with a large number of alternatives is ergpnomically ineffi- 

combinations, the dictionary is compared and any matches cient. For these reasons, a language model is used in step 

are identified in step 150. In the example presented, the 170 to assign a probability to each of the matched entries 

entries with matches found in the dictionary arc BALK, identifying the likelihood that the entry corresponds to the 
BALL, CALK, and CALL. S intended input. Preferably, step 170 causes the entry with the 

Alphanumeric recognition data can be stored in the highest probability of correlating lo the alphanumeric input 

memory 31. to be displayed in area 17 of the display 14 of the device 10. 

In the preferred embodiment, the dictionary 33 represents Th^^ is achieved by looking up and comparing unigram 

a plurality of words or symbols and is preferably stored as weighting values from language model data 34 (and/or other 

a trie, as is described in "Fundamentals of Data Structures in language model data) for the different candidate alternatives 

Pascal" by E. Horowitz and S. Sahni, pubUshed by Com- selecting the most likely candidates (e.g. four candidates 

puter Science Press, pages 512-520. The input is compared so°ie other number up to, preferably, a maximum of 10 

against the dictionary and preferably as each input is candidates). 

received the comparison is continually updated. In a pre- An advantage of the present invention is that it allows for 
ferred embodiment, a comparison that will not yield a match ambiguous input and converts that input to a word or 
is eliminated and that combination and any and all further alphanumeric value that has the highest probability or like- 
combinations that include that combination are also then lihood of correlating to the input. 

ignored or eliminated. For example purposes only, if a first Referring now to FIG. 5, details of the subroutine 130 of 

and a second input are 2 2, and the comparison with the FIG. 4 are shown. 

dictionary yields BB, and the dictionary does not have any if, following entry of one or more digits, and display of 

entries that would match BB, all further input that would the candidate matches on display area 17, key 19 is pressed, 

begin with BB combinations is eliminated or ignored. For bearing the markings "#»" the program proceeds to step 

example, if the third is 5, such that the input now is 2 2 5, 184. Step 184 represents the commencement of an end-of- 

all the entries such as BBJ, BBK, BBL, etc. are ignored or word mode, terminating the entry of the current word and 

eliminated and not compared to the dictionary. causing it to be displayed in text display area 15 followed by 

In accordance with a preferred embodiment of the present a space, 

invention, a breadth-first search is used to accelerated the ^ this point in the process, the highest priority match is 

comparison of the input to the stored dictionary entries. displayed in alternatives display area 17 of display 14. The 

Advantages of this type of search are that it reduces the time program awaits new date entry at step 190. If the word or 

for comparison and provides results with a greater accuracy character displayed in display area 17 (the highest ranked 

of correlating to the input. Breadth-first searches are match) is not the desired word or character— e.g. if CALL 

described in "Heuristics— Intelligent Search Strategies for is displayed, but BALL is desired, pressing (briefly) and 

Computer Problem Solving" by Judea Pearl, published by releasing key 19 (#») causes step 193 to cycle to the next 

Addison-Wesley Publishing Company, page 42 et seq. and choice. With each press of key 19, steps 192 and 193 cause 

"Introduction to Algorithms" by Thomas H. Cormen et al., the next rankordered choice to be displayed in alternatives 

published by M.l.T. Press and McGraw-Hill at page 469 et display area 17 and in text display area 15 (eventually 

seq. returning to the original highest rank-ordered choice). If, in 

In a preferred embodiment of the present invention, the step 190, the input key 18 («*) is pressed, step 198 causes 
dictionary whose contents are stored as a trie is seardied ^ the entire sequence of digits since the last space (the entire 

using a beam search. Various techniques for beam searches word or character) to be deleted and the program recom- 

are known in the art. For present purposes, a beam search is mences at step 120, where a new input can be received and 

a breadth-first search with limited bread±, where only, say the word can be re-entered. If the entry at step 190 is neither 

100 possible matches are considered at any moment. This of keys 18 and 19 then logically it must be another of the 

feature is particularly advantageous because spelling errors keys of keypad 16, representing a digit 0-9 and step 197 is 

are permitted. Preferably, a restriction is established limiting reached. This entry is interpreted lo be the start of a new 

the number of alternatives that will be searched. word or character. In step 197 the end-of-word mode ends 

More preferably, a restriction is established which and a new word (or character) begins. The program proceeds 

restricts or eliminates all alternatives from the dictionary to step 140 for processing of the new digit stream repre- 
that are returned which fall below a predetermined "score," 50 renting the new word or character. 

The score that is selected will vary depending upon the Other steps can be introduced. For example press-and- 

parametcrs that are determined to be the most significant for hold activations of various keys during the execution of 

the application for which the invention is being used. In a steps 182-198 can be given additional meanings and func- 

preferred embodiment, an alternative receives a high score tions. 

if the entry corresponds to an entry in the dictionary. If, 55 If, following step 120, key 18 is pressed, step 186 causes 

however, the alternative does not correspond to an entry in the program to proceed lo step 188 and the last digit entered 

the dictionary, the alternative is given a low score. is deleted. The program then continues with step 140 and the 

More preferably, each alternative that does not match an numeric sequence so far entered (omitting the last digit) is 

entry in the dictionary is eliminated. Preferably, for each delivered to comparison step 150 for processing as before, 
alternative that does match an entry in the dictionary, the 60 It will, of course, be appreciated by one of ordinary skill 

alternative is compared to a imigram language model to in the art of computer programming that the steps illustrated 

determine the frequency of occurrence of that alternative. in FIGS. 4 and 5 can be implemented in many different 

In the preferred embodiment, all input that matches or forms and do not have to be implemented in the sequences 

corresponds to an entry in the dictionary at step 150 of HO. shown. 

4 is returned to the memory 32. In many instances, there are 65 If at any time in mode 40 (except between steps 184 and 

too many possible matches for all matches to be displayed 195) the desired word to be entered in the text display area 

in alternatives display area 17. Moreover, presenting a user 15 is not the highest priority word, key 19 can be pressed 



05/08/2004. EAST Version: 1.4.1 



5,952,942 

7 8 

down and held down for a predetennined time (e.g. 1.5 only letter trigrams to select the next candidate character, 

seconds) and this will ca\ise cycle mode 44 (FIG. 3) to When key 19 is pressed, supervised data entry mode ends, 

operate and the candidate matches in display area 17 will the displayed (or in the preferred alternative the highest 

cycle through (cycling to the left if more than one is priority displayed) word is stored in memory 32 and dis- 
displayed). Upon release of the respective button, standard s played in lexl display area 15 and the program remms to 

entry mode 40 resumes. Pressing-and-holding of key 19 standard entry mode 40. Supervised entry mode is particu- 

between steps 184 and 195 can be given a dififerent meaning larly useful for entry of words not included in the dictionary 

(because pressing and releasing that key at that time already 33 and for entry of names 

operates the function of cycling through the choices). It is embodiment described above, a predetermined key 
preferred that here pressmg-aod-holding key 19 has the lo (e.g. "r') is preferably to select and enter the highest rank 

effect of entering a space in text display area 15. ^^dered candidate combination. 

When the desired word is displayed in the left-most A memory (e.g. EEPROM 31 or a separate computer disk) 

position m the display area 17, it can be selected by pressmg ^.^n be provided having instructions and data stored therein 

key 19 briefly (less than the predetermined time) and releas- j^at, when executed, cause the microprocessor 30 to com- 
mg It. This acUon dehvers the word to memory 32 and to text 15 prfse: an input; a dictionary 33; language model data 34; a 

display area 15 as described above. j^^rch algorithm 150, to search the dictionary for candidate 

If the display area 17 is small in comparison to the length combinations of characters corresponding to digits received 

of the candidate matches, there may be only one candidate at the input (input from input pad 16); a rank ordering 

displayed at one time and key 19 will cause the next function 170 that rank orders the candidate combinations 
candidate (in rank order of likelihood of selection) to be 20 according to the language model data; and an output (to 
displayed. ^ display 14 or loudspeaker 38) to provide feedback indicating 

The standard mode of text entry 40 described and illus- at least a highest rank ordered candidate combination, 

trated in FIGS. 4 and 5 allows for very fast text entry from jhere now follows a description of how the above 

multiplexed keys where each key represents a plurality of embodiment of a data entry device can be modified for very 
letters or characters. For example, 65% of English text is ^5 ^i^^i^ f^^^ entry of Chinese characters. In this embodi- 

written using only 100 words. 91% of these words can be ^ent it is preferred (but not essential) that one of a pluraHty 

entered without having to use keys 18 and 19 to scroll of keys (e.g. G-9) is activated to select and enter one of a 

beyond the first presented match. Thus the arrangement plurality of rank ordered candidate combinations, 

described comes close to meeting a goal of one keystroke g-^^^ ^.^ ^^^^^^ ^ ^^^^^^^ ^^^^^^^ representation 

entry per letter. Chinese characters has been adopted known as "pinyin". 

Where the words to be entered are all included in the xhe pinyin for each Chinese character, like each English 

dictionary standard entry mode 40 (and associated modes 42 ^^^^^ made up from combinations of the 26 letters of the 

and 44) can be used continuously to enter text. ^^^^^ alphabet A-Z. 

In alternative embodiment of standard entry mode 40, a ^ ^^^^^-^ ^ascd Chinese keypad data entry device can 

bigram language model (two-word combmaUon model) is ^^^^ ^^^^ ^^^^^ ^^^^^^ ^ ^^own in HG. 1. The 

used to determine the frequency of two consecutive words ^ ^^^j^ telephone digit keys and 26 English letters 

occurrmg together and to rank order the candidate matches ^^^^^ ^ ^ (Alternatively the digits 7 through 0 

accordingly. This embodiment, however, can require a large represent <p r s>, <t u v>, <w x y>, <q z> respectively 

memory 31, especially when operating m the English or digits 7 through 9 and digit 1 can represent <p r s>, <t u 

language, on account of the very large number of possible <^ ^ respectively.) 

bigrams. A trigram language model can be used to determine n r • * nr^ ^ j j- r 

e <? *u J • . .t. Rcierrmg to FIG. 6, a mode diagram for a program 

the frequency of three words occurring together. u i-.t- uj- 

^ / . , - . i. ^ . operated by microprocessor 30 is shown for this embodi- 

In accordance with preferred features of the present ^^^^ invention. Ihcrc is a standard entry mode 300 

invention, alternaUve data entry modes are provided. ^-^^ associated character selection mode 302. Character 

Numeric entry mode 46 can be entered by pressing and selection mode has scroll left and scroll right modes 304 and 

holding a key corresponding to the digits 0-9. Upon pressmg 305 as sub-modes. An alternative alphanumeric entry mode 

and holding one of these keys, the different multiplexed provided 

characters (digit and letters) are displayed in cyclic order in j„ ^^^^^^^^ ^^^^ ^ ^^^^ ^^^^^ . j„ 

the d^play area 17. For example pressing and holding the ^^^^ Chinese character he wants to input by using the 

2 key causes the characters A-B.C.2.A.B.C.2 to be j^^^^ ^^^^ ^ ^J^^ ^^^^^ ^ 

displayed in sequence (or 2.A-B-C etc.). If the key is ^^^^ ^ ^ ^^^^ ^P^^ 

released while a numeral is displayed, numenc entry mode verification, a digit string is display«i in a smaU window 13 

46 IS activated. If it is released while a letter is displayed, ^^ich is located at the top right of the display 14). He can 

supervised entry mode 48 is acUvated. ^^^^ j^,^^^ ^ ^.^.^ ^ ^^^^^ When finished, 

Once numenc entry mode 46 is acUvated, every subse- he presses key 19. The program then proceeds to character 

quent key press is assumed lo be a number, until key 19 is selection mode 302 and displays in character selection area 

pressed. Hius the number 8211223 shown in FIG. 1 can ^ ^^^^^y the top 10 Chinese characters that have the 

easily be entered by one long press of key «8" foUowed by pj^yin entered. The characters are rank ordered based 00 the 

a short press of each of the subsequent keys. language model data 34. A number from 0 to 9 is displayed 

Upon pressing key 19 the program returns to standard under each Chinese character. The user can select the 

data entry mode 40. Chinese character he wants to input by pressing its corrc- 

When supervised data entry mode 48 is entered, each spending number displayed. For example, if the Chinese 

subsequent letter can also be entered by a similar prcss-and- character marked by 3 is the desired one, he simply presses 
scroll method of letter selection (as is known in the art). 65 the key marked by 3 nto put it in the text area 15 of the 

As a preferred alternative, supervised data entry mode 48 display 14 and store it in memory 32. The program then 

operates in the same way as standard entry mode 40 but uses returns to standard entry mode 300. 
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If the user can not find the Chinese character he wants to standard mode 300, when pressing and holding key 0 for a 

input in the display area 17, he can press key 19 again, then predetermined time (e.g. 1-5 seconds), the program enters 

mode 304 of the program gives a list of another 10 charac- the Chinese punctuation and symbol entry mode 620 and 

ters and these are displayed in display area 17. The user can displays in character selection area 17 of the display digit 0, 
revert to the previous list of 10 characters by pressing key 18 5 space, punctuation marks, and other symbols, 
to activate mode 304. Since the dictionary 33 contains every 

possible Chinese character, the user can always find the right 0, . : ! ?( )% 

character he wants to input. ^& ' 01 23456789 

If the pinyin the user entered is not a legitimate pinyin of 
a Chinese character, the system displays nothing and gives 

a beep through the loudspeaker 38 or leaves the digit stream A number from 0 to 9 is displayed under each symbol (or 

displayed unchanged in window 13 and gives a beep. item). The user can select the Chinese symbol be wants to 

Once the user has entered the pinyin followed by key input by pressing its corresponding number displayed. If the 

19, the two editor keys and become the selecting user cannot find the symbol he wants to input, he can press 

keys for the top 10 candidate list, one is forward, the other key 19, then mode 620 of the program gives a list of another 

is backward. The user roust make his selection by pressing 10 Chinese symbols and these are displayed in the display 

one of 10 telephone digit keys even if he can not find the area 17. The user can revert to the previous list of 10 

Chinese character he wants to input. This returns the opera- symbols by pressing key 18 to activate mode 620. There are 

tion to standard entry mode 40 and key 18 can be used to more than 600 Chinese symbols. 

delete the entered character and start again with pinyin entry The system uses keys 1-9 to enter ASCII symbols, which 

for the desired character. include English alphanumerics, punctuation marks, and 

An example of nine pinyin entries, with corresponding other symbols. In standard mode 300, when pressing/ 

digit streams 502 and examples of selected characters 504 holding key 1 for a predetermined time (e.g. 1-5 seconds), 

corresponding to those pinyin entries is shown in FIG. 7. the program enters mode 610 and displays digit 1, space, 

There are four frequently used punctuation marks punctuation marks, and other ASCII symbols. 
"?", marked in key 1. The system uses the press/hold 

mode to enter one of them. When the user wants to input the ^ i, ,7I-W® 

punctuation, he presses and holds key 1. The program enters ^'^ ' 0123436789 
mode 310. Then the digit 1 and the four punctuation marks 
"j", ""y are then displayed on the screen one by one. 

The user can select the punctuation he wants to input by A number from 0 to 9 is displayed under each symbol (or 

release key 1 when it is being displayed. item). The user can select the ASCII symbol he wants to 

In a manner similar to the input of punctuation, alphanu- ^V^^ by pressing its corresponding number displayed. If the 

meric entry mode 310 is entered in a press/hold manner to ^ser cannot find the symbol he wants to input, he can press 
enter digits and Roman letters. For example, if the user 35 key 19, then mode 610 of the program gives a list of another 

wants to enter the Roman character M, he presses and holds 10 ASCII symbols and these are displayed in the display area 

key 6 which is marked by M, N, 0. The system then displays 17 The user can revert to the previous list of lO.symbols by 

digit 6, the small cases of these three characters and the pressing key 18 to activate mode 610. 

capital cases of these three characters one by one. The user Other digits and Roman letters can also be entered in a 
then releases the key when he sees the character M on the 4^ press/hold manner. For example, if the user wants to enter 

screen. Roman character M, he presses and holds key 6 which is 

Referring to FIG. 8, a state diagram alternative to that of marked by M, N, O. The system then enters mode 610 from 

FIG. 6 is shown. In this diagram, standard entry mode 300 mode 300 and displays digit 6, the small and capital cases of 

as well as Chinese (or other pictographic or grapheme) English letters m, n, o, like this 
character selection mode 302 and scroll right and scroll left 45 

modes 304 and 306 arc unaltered. Mode 310 of FIG. 6 is 6iwwM/V0Exit 

split into two alternative modes in FIG. 8, these being 01234567 
Chinese punctuation and symbol mode 620 and ASCII 
symbol entry mode 610. Mode 620 is reached from mode 

300 by pressing and holding key "0". Mode 610 is reached 50 The user can then select the capital letter M by pressing key 

from mode 300 by pressing and holding one of keys "1-9". 4 ^nd the program remms mode 600. If the user accidentally 

These modes 620 and 610 have respective scroll right and passes and holds the keys 2 to 9, and he does not want to 

scoU left modes 621, 622, 611 and 612 similar to modes 306 e^er any Roman letters, he can press key 7 to return to the 

and 304. AddjtionaUy msert retuni mode 601 is provided standard mode 300 without entering any Roman letters, 
which is reached from mode 300 by pressing key 19 (#) 55 j^ode 601 allows a carriage return to be entered between 

without first entering any pinyin. Mode 302 is, in FIG. 8, ^^jds or characters. Mode 632 is the same as mode 44 in 

only reached by pressing key 19 {#) after entering pinyin. piG. 1 and mode 631 is similar. 

Modes 631 and 632 are added (scroll right and scroll left jhe arrangement of HGS. 1, 2 and 6 or 8 can be used to 

respectively) which can be reached from mode 300 by enter text using Zhuyin phonedc Chinese symbols instead of 

pressmg and holdmg respective keys 19 and 18 for a pinyin (lower case Roman) symbols, by suitably displaying 

prcdetermmed Umc (e.g. 1.5 seconds). the 37 Zhuyin phonetic Chinese symbols on 10 telephone 

Mode 620 is useful because, in Chinese, there are about keypad keys and suitably providing matching dictionary and 

500 punctuation and other symbols that do not form parts of language model data. 

words, but stand alone. These can be selected by pressing u will be apparent to those skilled in the art that the 
"0" and scroUing left or right to select the desired symbol. 65 disclosed invention may be modified in numerous ways and 

The system uses key 0 to enter Chinese punctuation marks may assume many embodiments other than the preferred 

and other Chinese symbols, such as space, slash, etc. In formsparticularly set out and described above. For example, 
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instead of providing a display 14 and user feedback through 
area 17 of the display, synthesized voice can be output 
through the speaker 38 to provide user feedback of a 
candidate match. Additionally, the input pad 16 and loud- 
speaker 38 can be remote from the microprocessor 30 and its ^ 
memories 31 and 32, providing remote data entry. This 
arrangement is particularly useful for allowing a standard 
telephone handset to be used for text entry, for example for 
entering paging messages to a paging system from a remote 
location. Accordingly, it is intended by the appended claims 
to cover all modifications of the present invention that fall 
within the true spirit and scope of the present invention and 
its equivalents. 

What is claimed is: 35 

1. A method of text entry into a device comprising the 
steps of: 

activating keys of a keypad, where a key represents a 
plurality of characters in a first alphabet; ^ 

delivering input to a processor corresponding to a set of 
plural keys activated; 

searching a dictionary for a set of alternative characters in 
a second alphabet, each alternative character ambigu- 
ously corresponding to the set of plural keys activated, 25 
to thereby provide candidate second alphabet charac- 
ters; 

rank ordering the candidate second alphabet characters 
utilizing language model data indicative of likelihood 
of usage in a given language to order the candidate 
second alphabet characters in order of probabihty; and 

providing feedback to a user indicating at least a highest 
rank ordered candidate second alphabet character. 

2. The method of claim 1, further comprising the step of 35 
activating a predetermined key to select and enter the 
highest rank ordered candidate second alphabet character. 

3. The method of claim 1, further comprising the step of 
activating one of the keys of the keypad to select and enter 
one of a plurality of rank ordered candidate second alphabet ^ 
characters. 

4. The method of claim 1, wherein the language model 
data comprises unigram data. 

5. The method of claim 1, wherein the language model 
data comprises letter Ingram data. 

6. A device comprising: 

a keypad, said keypad having an input area of discrete 
keys, each key representing plural characters in a first 
alphabet, and a display area, said input area configured 
to generate numeric input upon activation of the keys; 

display processing circuitry for presenting a graphical 
image of a second alphabet ideographic character on 
the display; 

a memory having at least alphanumeric recognition data 55 
stored therein; and 

digital processing circuitry for comparing the numeric 
input with the alphanumeric recognition data and for 
rank ordering the candidate second alphabet characters 
utilizing probability data indicative of likelihood of 60 
usage in a given language to order the candidate second 
alphabet characters in order of probability to provide to 
the display processing circuitry candidate recognition 
information including at least one likely candidate 
second alphabet ideographic character having a highest 65 
probability of corresponding to the numeric input 
value. 
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7. A method comprising the steps of: 

receiving input data from a keypad area of a device having 
a set of keys, each key representing a plurality of 
characters such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

comparing the input data with a dictionary and word 
bigram language model data to identify at least one 
likely entry likely to represent the input data; and 

displaying the at least one likely entry in a display area of 
said device. 

8. A method comprising the steps of: 

comparing input data received from a keypad area of a 
device with a dictionary, the keypad area comprising a 
set of keys, each key representing a plurality of char- 
acters such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

identifying candidate words from the dictionary corre- 
sponding to the input data; 

identifying, using word bigram language model data, at 
least one likely candidate word having a highest prob- 
ability of representing the input data; 

storing the at least one likely candidate word in memory; 

displaying the at least one likely candidate word in a 
display area of the device; and 

editing the at least one likely candidate word presented in 
the display area through input of the keypad area. 

9. A method of processing keypad entry into alphanu- 
meric text, comprising the steps of: 

receiving input as a sequence of numeric values; 

converting the sequence of numeric values to at least one 
alphanumeric unit; 

comparing the at least one alphanumeric unit with a 
dictionary to identify candidate words likely to repre- 
sent the input; 

assigning through word bigram data a probability to each 
of the candidate words; and 

displaying a candidate word having a highest probability. 

10. A device comprising: 

a keypad, said keypad having an input area and a display 
area, said input area configured to receive numeric 
input; 

display processing circuitry for presenting a graphical 
image on the display; 

a memory having at least alphanumeric recognition data, 
including word bigram data stored therein; and 

digital processing circuitry for comparing the numeric 
input with the alphanumeric recognition data to provide 
to the display processing circuitry candidate recogni- 
tion information including at least one likely candidate 
word having a highest probability of corresponding to 
the numeric input value, as determined by the word 
bigram data. 

U. A method of text entry into a device comprising the 
steps of: 

activating keys of a keypad, where a key represents a 
plurality of characters and activation of each key gen- 
erates an ambiguous input signal having a probability 
of representing any one of the plurality of characters for 
that key; 



05/08/2004, EAST Version: 1.4.1 



5,952,942 



13 



14 



delivering input to a processor corresponding to the keys 
activated; 

searching a dictionary for words corresponding to the 

keys activated to provide candidate words; 
rank ordering the candidate words utilizing word bigram 

language model data to order the candidate words in 

order of probability; and 
providing feedback to a user indicating at least a highest 

rank ordered candidate word. 

12. The method of claim 11, further comprising the step 
of activating a predetermined key to select and enter the 
highest rank ordered candidate word. 

13. The method of claim 11, further comprising the step 
of activating one of the keys of the keypad to select and enter 
one of a plurality of rank ordered candidate words. 

14. A text entry device comprising: a keypad having a set 
of keys, each key representing a plurality of characters such 
that activation of a key generates an ambiguous input signal 
having a probability of representing any one of the plurality 
of characters for that key; a dictionary; word bigram lan- 
guage model data; an output device; and a processor coupled 
to the keypad, dictionary, language model data and output 
device, 

the processor being programmed to receive input corre- 
sponditig to keys activated, to search for candidate 
combinations of characters corresponding to the keys 
activated, to rank order the candidate combinations 
according to the word bigram language model data and 
to provide feedback through the output device indicat- 
ing at least a highest probability rank ordered candidate 
combination. 

15. A method of processing keypad entry into alphanu- 
meric text, comprising the steps of: 

receiving input as a sequence of numeric values where 
each numeric value has a probability of representing 
any one of a plurality of characters; 

converting the sequence of numeric values to at least one 
alphanumeric unit; 

comparing the at least one alphanumeric unit with a 
dictionary to identify candidate words likely to repre- 
sent the input; 

assigning through word bigram data a probability to each 
of the candidate words; and 

displaying a candidate word having a highest probability 
data as determined by the word diagram data. 
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16. The method of claim 15, further comprising the step 
of: 

editing the candidate word having the highest probability. 

17. The method of claim 15, wherein the step of com- 
paring the at least one alphanumeric unit with a dictionary 
is completed through use of a breadth-first search. 

18. A device comprising: 

a keypad, said keypad having an input area of discrete 
keys, each key representing a plurality of characters, 
and a display area, said input area configured to receive 
numeric input such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

display processing circuitry for presenting a graphical 
image on the display; 

a memory having at least alphanumeric recognition data 
stored therein; and 

digital processing circuitry for comparing the numeric 
input with the alphanumeric recognition data and for 
assigning word bigram probabilities to words repre- 
sented by the alphanumeric recognition data, to provide 
to the display processing circuitry candidate recogni- 
tion information including at least one likely candidate 
word representing having a highest probability of cor- 
responding to the numeric input value as determined by 
the word bigram probabilities. 

19. A device for text entry comprLsing: 

a memory having instructions and data stored therein that, 
when executed, cause a digital processor to comprise: 
an input; 
a dictionary 

at least word bigram language model data; 

a search algorithm, to search the dictionary for combina- 
tions of characters and words corresponding to digits 
received at the input to provide candidate combinations 
and words; 

a rank ordering function that rank orders the candidate 
combinations and words according to the word bigram 
language model data; and 

an output to provide feedback indicating at least a highest 
rank ordered candidate combination or word. 
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